Approximation Measures for Conditional Functional Dependencies Using Stripped Conditional Partitions
نویسندگان
چکیده
Received Apr 11, 2017 Revised May 5, 2017 Accepted May 24, 2017 Conditional functional dependencies (CFDs) have been used to improve the quality of data, including detecting and repairing data inconsistencies. Approximation measures have significant importance for data dependencies in data mining. To adapt to exceptions in real data, the measures are used to relax the strictness of CFDs for more generalized dependencies, called approximate conditional functional dependencies (ACFDs). This paper analyzes the weaknesses of dependency degree, confidence and conviction measures for general CFDs (constant and variable CFDs). A new measure for general CFDs based on incomplete knowledge granularity is proposed to measure the approximation of these dependencies as well as the distribution of data tuples into the conditional equivalence classes. Finally, the effectiveness of stripped conditional partitions and this new measure are evaluated on synthetic and real data sets. These results are important to the study of theory of approximation dependencies and improvement of discovery algorithms of CFDs and ACFDs. Keyword:
منابع مشابه
Tsallis Entropy and Conditional Tsallis Entropy of Fuzzy Partitions
The purpose of this study is to define the concepts of Tsallis entropy and conditional Tsallis entropy of fuzzy partitions and to obtain some results concerning this kind entropy. We show that the Tsallis entropy of fuzzy partitions has the subadditivity and concavity properties. We study this information measure under the refinement and zero mode subset relations. We check the chain rules for ...
متن کاملOptimal Portfolio Selection for Tehran Stock Exchange Using Conditional, Partitioned and Worst-case Value at Risk Measures
This paper presents an optimal portfolio selection approach based on value at risk (VaR), conditional value at risk (CVaR), worst-case value at risk (WVaR) and partitioned value at risk (PVaR) measures as well as calculating these risk measures. Mathematical solution methods for solving these optimization problems are inadequate and very complex for a portfolio with high number of assets. For t...
متن کاملDiscover Dependencies from Data - A Review
Functional and inclusion dependency discovery is important to knowledge discovery, database semantics analysis, database design, and data quality assessment. Motivated by the importance of dependency discovery, this paper reviews the methods for functional dependency, conditional functional dependency, approximate functional dependency and inclusion dependency discovery in relational databases ...
متن کاملEvaluation of Model-Based Methods in Estimating Dynamic Functional Connectivity of Brain Regions
Today, neuroscientists are interested in discovering human brain functions through brain networks. In this regard, the evaluation of dynamic changes in functional connectivity of the brain regions by using functional magnetic resonance imaging data has attracted their attention. In this paper, we focus on two model-based approaches, called the exponential weighted moving average model and the d...
متن کاملAutomatic Discovery of Functional Dependencies and Conditional Functional Dependencies: A Comparative Study
Over the last twenty years, several algorithms have been proposed for automatic rule/constraint discovery from data, for the purpose of data cleaning. These algorithms look for constraints such as functional dependencies (FDs), conditional FDs (CFDs), inclusion dependencies (INDs), conditional INDs (CINDs), association rules, integrity constraints (ICs) and denial constraints (DCs), among other...
متن کامل